Malware detection verification and enhancement by coordinating endpoint and malware detection systems

ABSTRACT

Computerized techniques to determine and verify maliciousness of an object by a security logic engine are described. A method features receiving information pertaining to a first set of events associated with a first object (first information) from an endpoint and information pertaining to a second set of events associated with a second object (second information) from an analysis system. Thereafter, the likelihood of the cyber-attack being conducted on the network is determined by at least correlating the first information and the second information with at least events associated with known malicious objects. Any endpoint vulnerable to the cyber-attack are identified based on a configuration of each of the plurality of endpoints and requesting the analysis system to conduct one or more further analyses in accordance with at least a software profile identified in a configuration of the first endpoint of the plurality of endpoints identified as vulnerable.

RELATED APPLICATION

This application is a continuation of U.S. patent application Ser. No. 15/633,226, filed Jun. 26, 2017, now U.S. Pat. No. 10,462,173, issued Oct. 29, 2019, which claims priority from commonly owned Provisional Patent Application No. 62/357,119, entitled MALWARE DETECTION VERIFICATION AND ENHANCEMENT BY COORDINATING ENDPOINT AND MALWARE DETECTION SYSTEMS, by Ashar Aziz et al., filed on Jun. 30, 2016 the contents of which are incorporated herein by reference.

FIELD OF THE INVENTION

The present invention relates generally to cyber security and more particularly to verifying and enhancing the detection of a cyber-attack on a network.

BACKGROUND OF THE INVENTION

A cyber-attack may employ malware (malicious software), which may include a computer program or file that is harmful to a computer, computer network, and/or user. Conventional antivirus applications may be employed at computers, such as, for example, laptops and servers connectable as nodes (e.g., endpoints) of a network, to identify viruses and other malware using a signature-based approach. Antivirus applications identify malware using an antivirus engine that compares the contents of a file to a database of known malware signatures. Advanced malware often avoids detection by antivirus applications. Advanced malware is often polymorphic in nature, that is, changes its “fingerprint” while maintaining its central malicious functionality, thus avoiding matches against the signature database. Also, advanced malware is often custom-designed for use against targeted users, organizations or industries and not re-used against other targets. As such, targeted malware will often not match signatures of known generic malware. Given that advanced malware is able to circumvent conventional anti-virus analysis, this approach has been determined to be deficient.

Another solution employs a malware detection system to identify malware at the network periphery. In some solutions, detection at the network periphery may utilize a conventional network intrusion detection system (IDS) often incorporated into network firewalls to compare signatures of known malware against traffic for matches while, in other solutions, a two-phase network security appliance (NSA) may be employed. The two-phase approach may compare in-bound network traffic against known characteristics of malware in a static analysis phase and identify malicious behaviors during execution of the content in a dynamic analysis phase.

Detection at the network periphery may be limited by the capability of the malware detection system for precise and effective detection without excessive false positives (wrongly identified attacks) on the one hand (such as is often the case with IDSs), and for timely analysis of behaviors of the network traffic to prevent network intrusion on the other (such as may be the case with some NSAs pending completion of their analysis). Furthermore, the analysis at the network periphery may not provide sufficient information about the particular target or targets (e.g., endpoints) within the network and the potential scope and severity of the attack.

Moreover, the proliferation of malware detection systems and security software has inundated network administrators with security alerts. Actionable intelligence may be buried within these security alerts; however, the sheer number of the alerts makes it difficult for network administrators to identify high priority alerts, a situation exacerbated by the presence of false positives. Moreover, the alerts may not contain sufficient information regarding the progression of the attack once inside the network. Accordingly, a network manager may be unable to identify whether a cyber-attack is in progress or has even occurred and to determine appropriate and timely actions to contain and remediate potential damage.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and further advantages of the embodiments herein may be better understood by referring to the following description in conjunction with the accompanying drawings in which like reference numerals indicate identically or functionally similar elements, of which:

FIG. 1 is a block diagram of a network environment in accordance with one or more embodiments described herein;

FIG. 2 is a block diagram of an endpoint device architecture in accordance with one or more embodiments described herein;

FIG. 3 is a block diagram of a malware detection system (MDS) in accordance with one or more embodiments described herein;

FIG. 4 is a block diagram of a security logic engine in accordance with one or more embodiments described herein;

FIG. 5 is a flowchart of an exemplary method for enhancing endpoint cyber-attack detection in accordance with one or more embodiments described herein;

FIG. 6 is a flowchart of an exemplary method to configure improved techniques for enhanced endpoint cyber-attack detection in accordance with one or more embodiments described herein; and

FIG. 7 is a block diagram of an interoperational malware detection and verification system in accordance with one or more embodiments described herein.

DETAILED DESCRIPTION OF THE INVENTION Overview

Aspects of the invention reside in the interoperation of a network endpoint and a network connected malware detection system for the detection and/or verification of malware threats to mitigate or prevent data theft, operational compromise and other cyber-attack effects. The network endpoint (“endpoint”) and malware detection system (“MDS”) may coordinate and enhance their respective detection capabilities and even block malware as well as predict future victims. An endpoint (e.g., a computing device connectable to a network, such as a laptop, desktop, server, etc.) may detect behaviors of an object or a suspicious object during the endpoint's normal operation, and trigger an MDS to process the object for further detection and analysis. The results from the endpoint and MDS are combined and correlated to classify the object as malicious or benign. This technique may be used to automatically determine, without human intervention, whether the network is under a cyber-attack, evaluate the scope and target(s) of the cyber-attack, assess the risk to the network, identify and trace the path of the cyber-attack on the network, and recognize polymorphic malware used in an attack.

An endpoint may be configured to monitor the behaviors of an object processed by the endpoint. The behaviors may be indicative of malware. In an embodiment, an endpoint processes an object (e.g., processing of a document file, etc.), the endpoint collects information related to the behaviors (“events”), and communicates the events to a security logic engine (“SLE”). In one embodiment, a software-based agent installed on the endpoint may monitor, collect and store the events, and, in some embodiments, classify the collected events as anomalous (i.e., represent unexpected, undesired, or unwanted behaviors) or otherwise suspicious (i.e., associated with potential or actual cyber-attacks), depending on the embodiment. The endpoint may communicate all monitored events or only suspicious events to the SLE for further correlation and classification of the object. The communications from the endpoint may trigger further analysis of the object by the MDS, as described below.

The SLE may determine or verify a cyber-attack by combining the analysis results of the endpoint and MDS. In some embodiments, a SLE may coordinate analyses of the object by the MDS in response to information received by the SLE from the endpoint, to obtain enhanced analysis results and make or verify a determination of maliciousness. The SLE may perform correlation of the respective analysis results and classification of the object as malicious or benign. The SLE may initiate and coordinate additional analyses of the suspicious object to identify other endpoints on the network that may be vulnerable to the verified malicious object by directing processing of the object by the MDS or endpoint(s) to collect further features related to the processing of the object. These endpoints may not yet be affected by the malware or the effects of the malware may not have been observed, as yet. By determining whether the malware may affect currently unaffected endpoints, the SLE may determine the scope of the threat, the target of the threat, and/or “predict” new victims susceptible to the malicious object.

The MDS may be contained within a special-purpose, dedicated malware detection appliance or a component in a general purpose computing device. As used herein, an appliance may be embodied as any type of general-purpose or special-purpose computer, including a dedicated electronic computing device, adapted to implement a variety of software architectures relating to exploit and malware detection functionality. The term “appliance” should therefore be taken broadly to include such arrangements, in addition to any systems or subsystems configured to perform a management function for exploit and malware detection, and associated with other equipment or systems, such as a network computing device interconnecting the WANs and LANs. The MDS may be available via a local network connection or remotely through the internet. The malware detection system may include a static analysis engine that may identify suspicious or malicious characteristics of an object, statically (operable without executing the object). Additionally, the MDS may utilize a dynamic analysis logic to process suspicious objects in an instrumented (i.e., monitored), virtual machine capable of detecting behaviors of the suspicious objects during processing. The dynamic analysis logic may be configured with (and run) an operating system and one or more applications (collectively, the “software profile”) that the suspicious object may expect or need for effective processing, and the software profile may include the same type of software run on the endpoint. By so doing, the software environment in which the endpoint monitored the suspicious behaviors may be replicated in the software profile run on the MDS. In this way, behaviors that may be exhibited only in the presence of those applications will be detected. The SLE (which may be a component of the MDS in some embodiments) may combine the results of the static and dynamic analyses to classify the object as malicious or benign.

During operation, for example, the MDS may receive a suspicious object from an endpoint, when connected to the network, along with information regarding the software profile of the endpoint and behavioral features identified by the endpoint (indicators of compromise or “IOC's” for short) during processing. The suspicious object received by the MDS may be processed in a virtual machine of the dynamic analysis logic, which observes the behaviors exhibited by the virtual machine during the processing of the object. In some embodiments, the dynamic analysis logic may guide the processing and monitoring of the object in response to the information received by the MDS from the endpoint (e.g., specific behaviors to monitor, specific responses to dialog boxes or requests for further user input). Therefore, the MDS may receive an information package from the endpoint indicating an object ID, IOC's and context related to the processing of the object at an endpoint, and conduct the further malware analysis accordingly on the object.

The statically detected characteristics and/or dynamically observed behaviors (collectively, “features”) of the suspicious object, during processing by the MDS, may be provided to a classification engine for classification of the object as malicious or benign. The classification engine of the MDS may generate a classification of the suspicious object based on a correlation of the features with known features of malware and benign objects. The known features of malware are determined based on heuristics and experiential knowledge of previously analyzed malware. The classification of the suspicious object may be conveyed to an SLE for verification of the determination of maliciousness by combining the dynamic analysis results (of the MDS) with the monitored behaviors received from the endpoints. Verification of the determination of maliciousness may thus include correlation of the analysis results with those associated with a known corpus of benign and malicious objects for classification of the object.

In some embodiments, the SLE may coordinate further analyses by the MDS based on additional information received from another endpoint. For example, the SLE may direct the MDS to conduct additional analyses of the object using a different software profile, that is, the software profile of the other (additional) endpoint. The SLE, in response to receiving these additional analysis results from the MDS, may make or modify the determination of maliciousness (e.g., verify the first determination of maliciousness, etc.). The SLE may also modify the determination of maliciousness based on the monitored behaviors reported by the additional endpoint, which may have the same or different software profile and may report the same or different monitored behaviors. This enhanced determination of maliciousness may be used to evaluate or modify the risk represented by the malware to endpoints on the network. For example, a malicious object is determined to affect a greater set of applications, or versions of an application, included in the software profiles of the original and additional endpoints, and thereby represent a threat to a larger set of endpoints on the network running those software profiles. For example, an initial interoperation of a first endpoint and an MDS may indicate all versions of Office applications using Windows 8.1 are susceptible to a cyber-attack by an object. This may be reported to a network administrator. Subsequently, additional information received by the SLE from a second endpoint indicates that the same applications running on Windows 10 are also susceptible to the malicious object. Accordingly, the SLE may initiate an alert or report to the effect that the determination of maliciousness is verified and expanded to include the additional software profile information.

In still other embodiments, the SLE may predict an additional set of computers on the network may be at risk from malware by identifying a threat vector which may be common to these other endpoints. For example, if on Windows 8.1 all versions of Firefox are determined to be vulnerable to a malicious object, the SLE may direct the MDS to process the suspicious object using a software profile including Windows 10 and Firefox (if that software profile is available to the MDS) to predict the maliciousness of the suspicious object to Windows 10. Similarly, the SLE may direct an endpoint configured with one or more versions of Firefox running over Windows 10 to run the same object using a closely monitored and/or sandboxed (or otherwise protected) process to detect suspicious behaviors. Based on the features detected by the endpoints and/or the MDS, the SLE may determine if Windows 10 computers of the network are vulnerable to the malicious object. In response to this determination of maliciousness, the SLE may issue a security alert, determine the priority to mitigate or repair the threat, and/or identify further systems that may be affected. By correlating the features associated with the malware, the SLE may identify endpoints on the network that may be affected by the malicious object even if no IOCs in those endpoints have yet been reported.

The combined system, using the interoperation of endpoint(s) and an MDS to coordinate the detection and/or verification of malware, and, in some cases, prediction of malware threats to a network so as to mitigate or prevent data theft, operational compromise, and cyber-attack effects. The system may leverage endpoints as a distributed network of malware behavioral sensors to communicate with an MDS on the network for further analysis and combine the results by a SLE, which may be implemented as part of or separate from (and in communication with) the MDS and endpoint(s). The SLE may utilize the features identified by the endpoints and MDS to identify or verify malware, trace a malware attack within the network, predict additional vulnerable endpoints, and enhance the ability of a security administrator to protect the network.

Description of the Embodiments

FIG. 1 is a block diagram of a network environment 100 that may be advantageously used with one or more embodiments described herein. The network environment 100 may be organized as a plurality of networks, such as a public network 110 and/or a private network 120 (e.g., an organization or enterprise network). According to this embodiment, the public network 110 and the private network 120 are communicatively coupled via network interconnects 130, and provide network connectivity and communication to intermediary computing devices 140, such as network switches, routers and/or firewalls, one or more endpoint device(s) 200, a malware detection system (MDS) 300 and a security logic engine 400.

The intermediary computing devices 140 communicate by exchanging packets or messages (i.e., network traffic) according to a predefined set of protocols, such as the Transmission Control Protocol/Internet Protocol (TCP/IP). However, it should be noted that other protocols, such as the HyperText Transfer Protocol Secure (HTTPS) for example, may be advantageously used with the inventive aspects described herein. In the case of private network 120, the intermediary computing device 140 may include a firewall or other computing device configured to limit or block certain network traffic in an attempt to protect the endpoint devices 200 from unauthorized users and attacks. The endpoint device 200 is communicatively coupled with the security logic engine 400 by the network interconnects 130, and may provide metadata monitored and stored by the endpoint device 200 to the security logic engine 400. The malware detection system 300, security logic engine 400, and optionally one or more intermediary network device 140 are similarly connected by interconnects 130.

As illustrated in FIG. 2 in greater detail, the endpoint device 200 has physical hardware including hardware processors 210, network interface(s) 220, a memory 230, a system interconnect 270, and optionally, a user interface 290. The memory 230 may contain software comprising an operating system (OS) 240, one or more applications 265, an agent 250, event processing and filtering logic 257, and, in some embodiments, an endpoint device classifier 260. The physical hardware (e.g., hardware processors 210, network interfaces(s) 220, memory 230) may be connected for communication by the system interconnect 270, such as a bus. Generally speaking, an endpoint device 200 is a network-connected electronic device, such as a general purpose personal computer, laptop, smart phone, tablet or specialized device such as point of sale (POS) terminal and server.

The hardware processor 210 is a multipurpose, programmable device that accepts digital data as input, processes the input data according to instructions stored in its memory, and provides results as output. One example of the hardware processor 210 is an Intel® microprocessor with its associated instruction set architecture, which is used as a central processing unit (CPU) of the endpoint device 200. Alternatively, the hardware processor 210 may include another type of CPU, a digital signal processor (DSP), an application specific integrated circuit (ASIC), or the like.

The network device(s) 280 may include various input/output (I/O) or peripheral devices, such as a storage device, for example. One type of storage device may include a solid state drive (SSD) embodied as a flash storage device or other non-volatile, solid-state electronic device (e.g., drives based on storage class memory components). Another type of storage device may include a hard disk drive (HDD). Each network device 280 may include one or more network ports containing the mechanical, electrical and/or signaling circuitry needed to connect the endpoint device 200 to the private network 120 to thereby facilitate communications over the system network 100. To that end, the network interface(s) 220 may be configured to transmit and/or receive messages using a variety of communication protocols including, inter alia, TCP/IP and HTTPS. The terms “or” and “and/or” as used herein are to be interpreted as inclusive or meaning any one or any combination. Therefore, “A, B or C” or “A, B and/or C” mean “any of the following: A; B; C; A and B; A and C; B and C; A, B and C.” An exception to this definition will occur only when a combination of elements, functions, steps or acts are in some way inherently mutually exclusive.

The memory 230 may include a plurality of locations that are addressable by the hardware processor 210 and the network interface(s) 220 for storing software (including software applications) and data structures associated with such software. The hardware processor 210 is adapted to manipulate the stored data structures as well as execute the stored software, which includes an operating system (OS) 240, one or more applications 265, an agent 250, and an endpoint device classifier 260.

The operating system (OS) 240 is software that manages hardware (e.g., hardware processors 210, network interface(s) 220, memory 230, network device(s) 280, etc.), software resources, and provides common services for computer programs, such as applications 265. For hardware functions such as input and output (I/O) and memory allocation, the operating system 240 acts as an intermediary between applications 265 and the computer hardware, although the application code is usually executed directly by the hardware and frequently makes system calls to an OS function or be interrupted by it.

The agent 250 is an executable software component configured to monitor the behavior of the applications 265 and/or operating system 240. The agent 250 may be configured to monitor (via monitoring logic 255), and store metadata (e.g., state information, memory accesses, process names, time stamp, etc.) associated with content executed at the endpoint device and/or behaviors (sometimes referred to as “events”) that may be associated with processing activity. Events are behaviors of an object that are exhibited by processes executed by the endpoint and are monitored by the agent 250 during the normal operation of the endpoint. Examples of these events may include information associated with a newly created process (e.g., process identifier, time of creation, originating source for creation of the new process, etc.), information about the type and location of certain data structures, information associated with an access to certain restricted port or memory address, or the like. The agent 250 may also retrieve and communicate off the endpoint device 200 to a remote electronic device such as the SLE 400 context information such as the contents of the endpoint device's memory or hard drive. Moreover, the monitoring logic 255 may be configurable so as to enable or disable the monitoring of select behaviors, activities or processes. In some embodiments, the agent 250 may include an event processing and filtering logic 257, which, for example, applies heuristics, rules or other conditions to the monitored behaviors, to identify anomalous or unexpected behaviors and determine if the object is suspicious. Notably, the endpoint may perform (implement) exploit and malware detection as background processing (i.e., minor use of endpoint resources) with user-directed data processing being implemented as its primary processing (e.g., majority use of endpoint resources). The processing and filtering logic 257, in some embodiments, may scan content being processed for matches with indicators (signatures). Also, in some embodiments, the agent 250 is configured to provide the events including the metadata to the endpoint device classifier 260 so as to classify the behaviors as suspicious or even malicious. Further information regarding an embodiment of an agent may be had with reference to U.S. Pat. No. 8,949,257 issued Feb. 3, 2015, entitled “Method and System for Collecting and Organizing Data Corresponding to an Event,” the full disclosure of which being incorporated herein by reference.

The agent 250 may receive from the security logic engine 400 and/or malware detection system 300 a communication identifying a malicious object for elevated levels of monitoring and/or identifying certain specified behaviors or processes for monitoring. The communication identifying the malicious object may, by way of example, include signatures (“fingerprint”), indicators, and/or patterns or sequences of behaviors. Elevated levels of monitoring of the suspicious object may include modifying system settings or configuring the agent 250. System setting modification may include activating additional system monitors (via the monitoring logic 255) to further observe suspicious object execution and expediting communications of detection results to the SLE.

The agent 250 may provide metadata related to the monitored behaviors to the endpoint device classifier or classification engine 260 for classification of an object, e.g., as to threat level. The threat level may be indicated by the classifier 260 in any of various ways, such as indicating the object as malicious or suspicious, where “suspicious” imports less certainty or a lower threat level than a classification of “maliciousness.” The agent 250 and classifier 260 may cooperate to analyze and classify certain observed behaviors of the object, based on monitored events, as indicative of malware. The classifier 260 may also be configured to classify the monitored behaviors as expected and unexpected/anomalous, such as memory access violations, in comparison with behaviors of known malware and known benign content as identified through the use of machine learning techniques and experiential knowledge.

In some embodiments, the agent 250 may utilize rules and heuristics to identify the anomalous behaviors of objects processed by the endpoint device 200. Examples of an anomalous behavior may include a communication-based anomaly, such as an unexpected attempt to establish a network communication, unexpected attempt to transfer data (e.g., unauthorized exfiltration of proprietary information, etc.), or an anomalous behavior may include an execution anomaly, for example, an unexpected execution of computer program code, an unexpected Application Programming Interface (API) function call, an unexpected alteration of a registry key, or the like. The endpoint device monitoring rules may be updated to improve the monitoring capability of the agent 250.

The endpoint device monitoring rules may be periodically or aperiodically updated, with the updates received by the agent 250 from the malware detection system 200 and/or the security logic engine 400. The update may include new or modified event monitoring rules and may set forth the behaviors to monitor. The monitoring logic 255 may be configured to implement the monitoring rules received by the endpoint device agent 250. For example, the agent 250 may be updated with new behavioral monitoring rules which may be provided to the monitoring logic 255, the monitoring logic configures the monitors with the monitoring rules received by the agent 250 for a certain running process or certain application 265, for example, to monitor for spawned additional processes. Alternatively, the behavioral monitoring rules may be received by the endpoint device 200 in response to a request from the endpoint device 200 to determine whether new rules are available, and in response, the new rules are downloaded by the endpoint device 200, provided to the agent 250, and used to configure the monitoring logic 255.

In some embodiments, an endpoint device 200 may include a separate user interface 290. The user interface 290 may produce a graphical or textual based representation to a user of the endpoint device 200. The user interface 290 provides the user with the ability to interact with the computer. The user interface 290 may not be present for an endpoint device that is not dedicated to a single user or does not require the interaction with a user.

Malware Detection System

Referring now to FIG. 3, the malware detection system 300 can be implemented as a network security appliance. As used herein, an appliance may be embodied as any type of general-purpose or special-purpose computer, including a dedicated computing device, adapted to implement a variety of software architectures relating to exploit and malware detection and related functionality. The term “appliance” should therefore be taken broadly to include such arrangements, in addition to any systems or subsystems configured to support such functionality, whether implemented in one or more network computing devices or other electronic devices, equipment, systems or subsystems. Generally speaking, the malware detection system 300 may be implemented as one or more network-connected electronic devices, which each includes physical hardware comprising hardware processor(s), network interface(s), a memory, a system interconnect, an optional user interface, a system interconnect, which may be arranged and organized as shown in FIG. 2. Accordingly, each of the components of the malware detection system 300 shown in FIG. 3 and described below may be implemented as one or more computer programs or modules executable on one or more processors and stored in memory.

As thus embodied, the malware detection system 300 includes a network interface(s) 310, a static analysis logic 320 comprising at least an indicator scanner 330, and a heuristics engine 335, a dynamic analysis logic 340 comprising at least a scheduler 350, a store of software profiles 355, and one or more virtual machine(s) 360, an event database and logic 362, a classifying engine 380, an indicator generator 385, and a reporting engine 390. The malware analysis may involve static, dynamic and/or an optional emulation analysis, as generally described in U.S. Pat. No. 9,223,972, the entire contents of which are incorporated by reference.

The network interface(s) 310 may receive and capture network traffic transmitted from multiple devices without appreciably affecting the performance of the private network 120 or the devices coupled to the private network 120. In one embodiment, the malware detection system 300 may capture objects contained in network traffic using the network interface(s) 310, make a copy of the objects, pass the objects to the appropriate endpoint device(s) 200 and pass the copy of the objects to the static analysis logic 320 and/or the dynamic analysis logic 340. In another embodiment, the malware detection system 300 may capture the objects using the network interface(s) 310 and pass the objects to the static analysis logic 320 and/or the dynamic analysis logic 340 for processing prior to passing the objects to the appropriate endpoint device(s) 200. In such an embodiment, sometimes called a “blocking deployment,” the objects will only be passed to the appropriate endpoint device(s) 200 (e.g., the destination of the network traffic as identified in network traffic packets) if the analysis of the objects does not indicate that the objects are associated with malicious, anomalous and/or unwanted characteristics and/or behaviors.

The network interface(s) 310 and static analysis logic 320 may be located at the periphery of the private network 120. The periphery of a private network 120 may be located at or near the interconnect(s) 130 between the private network 120 and other networks, e.g., behind a firewall (not shown) on the private network 120. For example, the network interface(s) 310 and static analysis logic 320 components of the malware detection system are located at the private network periphery while the dynamic analysis logic 340, scorer 370, classifier 380, indicator generator 385 and reporting engine 390 are each located on a remote server on the private network 120 or on a public network 110 connected to the private network 120 via interconnects 130. Alternatively, all of these components may be co-located at or near the periphery of the private network 120.

The static analysis logic 320 may receive the network traffic to then extract the objects and related metadata, or may receive the objects and related metadata from the network interface(s) 310 already extracted. The term “object” generally refers to a collection of data, whether in transit (e.g., over a network) or at rest (e.g., stored), often having a logical structure or organization that enables it to be classified for purposes of analysis. The static analysis logic 320 may provide the objects to the indicator scanner 330 to identify if the objects match known indicators of malware. The term “indicator” (or “signature”) designates a set of characteristics and/or behaviors exhibited by one or more malware that may or may not be unique to the malware. Thus, a match of the signature may indicate to some level of probability that an object constitutes malware. In some contexts, those of skill in the art have used the term “signature” as a unique identifier or “fingerprint.” For example, a specific malware or malware family, may be represented by an indicator which is generated, for instance, as a hash of its machine code, and that is a special sub-case for purposes of this disclosure. The indicator scanner 330 may incorporate, in memory (not separately shown), a database of known malware indicators. The database of known malware indicators may be updated by receiving through the network interface(s) 310 from the public network 110 or the private network 120, via network interconnects 130, new indicators of malware. The database of indicators may also be updated by the indicator generator 385.

The heuristics engine 335 determines characteristics of the objects and/or network traffic, such as formatting or patterns of their content, and uses such characteristics to determine a probability of maliciousness. The heuristic engine 335 applies heuristics and/or probability analysis to determine if the objects might contain or constitute malware. Heuristics engine 335 is adapted to analyze certain portions of the network traffic, constituting an object (e.g., the object may include a binary file), to determine whether a portion of the network traffic corresponds to either, for example: (i) a “suspicious” identifier such as either a particular Uniform Resource Locator “URL” that has previously been determined as being associated with known malware, a particular source or destination (IP or MAC) address that has previously been determined as being associated with known malware; or (ii) a known malicious pattern corresponding with malware. The heuristics engine 335 may be adapted to perform comparisons of an object under analysis against one or more pre-stored (e.g., pre-configured and/or predetermined) attack patterns stored in memory (not shown). The heuristics engine 335 may also be adapted to identify deviations in messaging practices set forth in applicable communication protocols (e.g., HTTP, TCP, etc.) exhibited by the traffic packets containing the object, since these deviations are often characteristic of malware. A match of an identifier may indicate, to some level of probability, often well less than 100%, that an object constitutes malware. The identifiers may represent identified characteristics (features) of the potential malware. The heuristics engine 335 may include scoring logic to correlate one or more characteristics of potential malware with a score of maliciousness, the score indicating the level of suspiciousness and/or maliciousness of the object. In one embodiment, when the score is above a first threshold, the heuristic engine 335 may generate an alert that the object is malicious. When the score is greater than a second threshold but lower than the first threshold, the object may be provided to the static analysis logic and/or the dynamic analysis logic for further analysis. When the score is less than the second threshold, the threat detection system may determine no further analysis is needed (e.g., the object is benign).

For dynamic analysis, the static analysis engine 320 may provide the object to the scheduler 350. The scheduler 350 is responsible for provisioning and instantiating a virtual machine to execute the object at a schedule time. In some embodiments, the heuristic module 335 transmits the metadata identifying a destination device to the scheduler 350, which can then provision a virtual machine with software (operating system (OS) and one or more applications) and other components appropriate for execution of the network data (data packets or objects), which in some cases are those associated with the destination device. In other embodiments, the scheduler 350 receives one or more data packets of the network traffic from the network interface(s) 310 and analyzes the one or more data packets to identify the destination device. A virtual machine is executable software that is configured to mimic the performance of a device (e.g., the destination device).

The scheduler 350 can configure the virtual machine to mimic the performance characteristics of a destination device that are pertinent for behavioral monitoring for malware detection. The virtual machine can be provisioned from the store of software profiles 355. In one example, the scheduler 350 configures the characteristics of the virtual machine to mimic only those features of the destination device that are affected by an object to be executed (opened, loaded, and/or executed) and analyzed. Such features of the destination device can include ports that are to receive the network data, select device drivers that are to respond to the network data and any other devices coupled to or contained within the destination device that can respond to the network data. In other embodiments, the dynamic analysis logic 340 may determine a software profile, and then configures one or more virtual machine(s) 360 with the appropriate ports and capabilities to receive and execute the network data based on that software profile. In other examples, the dynamic analysis logic 340 passes the software profile for the network data to the scheduler 350 which either selects from the store of software profiles 355 or configures the virtual machine based on that profile.

The store of software profiles 355 is configured to store virtual machine images. The store of software profiles 355 can be any storage capable of storing software. In one example, the store of software profiles 355 stores a single virtual machine image that can be configured by the scheduler 350 to mimic the performance of any destination device on the private network 120. The store of software profiles 355 can store any number of distinct virtual machine images that can be configured to simulate the performance of any destination devices when processed in one or more virtual machine(s) 360.

The processing of an object may occur within one or more virtual machine(s), which may be provisioned with one or more software profiles. The software profile may be configured in response to configuration information provided by the scheduler 350, information extracted from the metadata associated with the object, and/or a default analysis software profile. Each software profile may include a software application and/or an operating system. Each of the one or more virtual machine(s) 360 may further include one or more monitors (not separately shown), namely software components that are configured to observe, capture and report information regarding run-time behavior of an object under analysis during processing within the virtual machine. The observed and captured run-time behavior information as well as effects on the virtual machine, otherwise known as features, along with related metadata may be provided to a scoring logic 370.

The scoring logic 370 generates a score used in a decision of maliciousness by the classification engine 380. The score may be a probability value (expressed in any of various ways such as, for example, a numerical value or percent) or other indicator (quantitative or qualitative) of security risk or so-called threat level. The determination of the score of the object processed by the malware detection system 300 may be based on a correlation of each of the features identified by the static analysis logic 320 and dynamic analysis logic 340. The features may include characteristics, where characteristics include information about the object captured without requiring execution or “running” of the object. Characteristics may include metadata associated with the object, including, for example, anomalous formatting or structuring of the object. The features may also include behaviors, where behaviors include information about the object and its activities captured during its execution or processing. Behaviors may include, but are not limited to, attempted outbound communications over a network connection or with other processes, patterns of activity or inactivity, and/or attempts to access system resources.

The scoring logic 370 may correlate one or more characteristics and monitored behaviors (features) with a weight of maliciousness. The weight of maliciousness reflects experiential knowledge of the respective features (characteristics or monitored behaviors) and their correlations with those of known malware and benign objects. For example, during processing, the dynamic analysis logic 340 may monitor several behaviors of an object processed in the one or more virtual machine(s) 360, where, during processing, the object (i) executes a program, (ii) the program identifies personally identifiable data (e.g., login information, plain-text stored passwords, credit information), (iii) the program generates and encrypts the data in a new file, (iv) the program executes a network call, and (v) sends the encrypted data via the network connection to a remote server (exfiltrates the data). Each individual event may generate an independent score, weighted by the scoring logic 370, the weight based on experiential knowledge as to the maliciousness of each associated event. The individual scores or a combined score across these events may be provided to the classifying engine 380. Alternatively, in some embodiments, the generation of a combined score may be performed by the classifying engine 380, or the scoring logic 370 and classification engine 380 may be combined into a single engine.

The classifying engine 380 may be configured to use the scoring information provided by scoring logic 370 to classify the object as malicious, suspicious, or benign. In one embodiment, when the score is above a first threshold, the heuristic engine 335 may generate an alert that the object is malicious. When the score is greater than a second threshold but lower than the first threshold, the object may be provided for further analysis to the static analysis logic and/or the dynamic analysis logic for further analysis. When the score is less than the second threshold, the classifying engine 380 may determine no further analysis is needed (e.g., the object is benign). The threshold of maliciousness may be fixed, modified by as security administrator, and/or modified based on network conditions (for example, if a network is experiencing anomalous network conditions, if many other clients of a similar type are under confirmed attack, etc.). The classifying engine 380 may be configured to classify the object based on the characteristics identified by the static analysis logic 320 and/or the behaviors (expected and unexpected/anomalous) monitored by the dynamic analysis logic 340. In some embodiments, the classifying engine 380 may use only the correlation information provided by the scoring logic 370. That is, a determination of whether the monitored behaviors represent expected or unexpected (anomalous) behaviors is rendered by correlating the monitored behaviors against behaviors of known malware. Results of the static analysis may also be used in the correlation and classification, e.g., by being combined with the results of the dynamic analysis to yield a combined score. In an embodiment, further static analysis and/or dynamic analysis may be performed at the MDS 300 based on the results of correlation and classification engines.

In some embodiments, the classifying engine 380 may provide objects classified as malicious to the indicator generator 385, which may then generate indicators associated with these malicious objects. Additionally, the indicator generator 385 may receive non-malicious objects to generate a suitable indicator associated with non-maliciousness. In some embodiments, the indicators may be “fingerprint” type signatures, formed as a hash of the object. Alternatively, or in addition, the indicators may include identification of observed features, including characteristics and behaviors. The indicators thus generated may be provided to the security logic engine 400 for further enhancement (e.g., with additional indication of features) using results provided by endpoint devices 200. The classifying engine 380 may alternatively bypass the indicator generator 385 if it determines that the analyzed object is not malicious. The indicators may be provided to the indicator scanner 330 for use in inspecting (by scanning) subsequently received objects. In some embodiments, the indicator generator 385 may also distribute the indicators to the endpoint devices 200 and/or the security logic engine 400.

If the malware detection system classifies the object as malicious based on a static analysis results and/or dynamic analysis results, the reporting engine 390 may signal to a network or security administrator for action by an appropriate alert. In an embodiment, the reporting engine 390 may report the indicators (“signatures”) of detected behaviors of a process/object as indicative of malware and organize those indicators as reports for distribution to the endpoint devices 200.

As noted previously, the reporting logic 390 may be configured to generate an alert for transmission external to the malware detection system 300 (e.g., to one or more other endpoint devices 200, to the security logic engine 400, and/or to a central manager). The reporting logic 390 is configured to provide reports via the network interface(s) 310. The security logic engine 400, when external to the MDS 300, e.g., may be configured to perform a management function or a separate management system may be provided, depending on the embodiment, e.g., to distribute the reports to other MDS within the private network, as well as to nodes within a malware detection services and/or equipment supplier network (e.g., supplier cloud infrastructure) for verification of the indicators and subsequent distribution to other malware detection system and/or among other customer networks. Illustratively, the reports distributed by the management function or system may include the entire or portions of the original indicator reports provided by the MDS 300, or may include new versions that are derived from the original reports.

Security Logic Engine

As shown in FIG. 4, an embodiment of the security logic engine 400 includes a network interface(s) 410, a formatting logic 420, a correlation engine 430, a scoring logic 440, a classification engine 450, a labeler 460, a signature matcher 470, a risk analyzer 480, and a reporting engine 490. Generally speaking, the security logic engine 400 may be implemented as one or more network-connected electronic devices, which include(s) physical hardware comprising hardware processor(s), network interface(s), a memory, a system interconnect, an optional user interface, a system interconnect, arranged and organized as shown in FIG. 2. Each of the logic and engine components (including those just listed above) of the security logic engine 400 may be implemented as computer programs or modules executed by one or more processors and stored in memory.

The network interface(s) 410 can be coupled to a network such as private network 120 (FIG. 1) via the network interconnects 130. The network interface(s) 410 may support communication over a conventional link, such as over an Ethernet connection, USB connection, a FireWire connection, a serial connection, a parallel connection, or an ATA connection. The communication network interface(s) 410 may also support wireless communication (e.g., 802.11 a/b/g/n or wireless USB). It will be apparent to those skilled in the art that the communication network interface(s) 410 can support many wired and wireless standards. Where the SLE 400 is implemented within the MDS 300 and thus availed of network interface(s) 310, the network interface(s) 410 may be eliminated or disabled.

In some embodiments, a formatting logic 420 receives communication data from the network interface(s) 410 (or 310 of FIG. 3) and converts the data into a standardized format to be processed by other modules in the security logic engine 400 if not already in a suitable format. In general, the formatting logic 420 obtains data in disparate formats, which may often be device specific (e.g., an intermediary network device 140, malware detection system 300, endpoint device 200, etc.) or application version specific (e.g., endpoint device agent 250 version 1.0 may provide an output in a certain format and endpoint device agent version 2.0 may provide an output in a different format), and transforms the data into a readily consumable, common format. For example, the formatting logic 420 may transform data associated with a first endpoint device agent 250 to that of a common data format (e.g., JSON, etc.) such that any ensuing analysis and correlation with other devices may be provided using a common data format.

The correlation engine 430 may correlate the features received by the security logic engine 400 from an endpoint device 200 and the malware detection system 300 with known behaviors and characteristics of benign and malicious objects. Additionally, the correlation engine 430 may correlate the features received from the endpoint device 200 with those received from the malware detection system 300 to verify the determination of maliciousness obtained by the malware detection system 300 or determine the extent to which the features from these two vantage points (network periphery and endpoint) correlate with one another. The correlations just described in the preceding two sentences can be performed separately or in the same operation depending on the implementation, and in other embodiments one or the other may be eliminated altogether.

The results of the correlation performed by the correlation engine 430 may be provided to a scorer 440. The scorer 440 may generate a score based on each correlation of an observed feature with known behaviors and characteristics of benign and malicious objects. The classification engine 450 may utilize the scores generated by the scorer 440 to classify the object as malicious if it exceeds a threshold. The threshold may be fixed or dynamic. The maliciousness threshold may be “factory-set,” “user-set,” and/or modified based on information received via a network interface(s) 410.

The correlation engine 430 may be configured, depending on the embodiment, (a) to verify a classification of maliciousness made by the endpoint 200, (b) to provide greater or lesser confidence that an object processed by the endpoint 200 should be classified as malware, and/or (c) to determine whether the malware detection system 300 has received and is processing malware, and if so, whether the malware is the same as that detected by the endpoint 200. The first of these involves the correlation engine 430 to correlate at least the results of the endpoint 200 with those of the malware detection system 300. The last of these involves the correlation engine 430 correlating the features reported by the malware detection system 300 with those of known malware, and compare the correlation results with those obtained by the endpoint 200.

For example, the correlation engine 430 may receive, over a communication network via network interface(s) 410, (i) a feature set (features including behaviors and, in some embodiments, characteristics) monitored by the endpoint device agent 250, and (ii) a feature set (features including behaviors and, in some embodiments, characteristics) associated with an object classified by the malware detection system as malware, and in some embodiments, the associated score or threat level determined by the MDS. The correlation engine 430 may correlate the feature sets received from the endpoint device 200 and the MDS 300 to determine whether the endpoint 200 observed the same or similar features to those monitored in the MDS 300 on which its classification decision was based, and may also correlate those feature sets with known features exhibited by known malware and/or malware families. In so doing, the correlation engine 430 may apply correlation rules to determine whether the feature sets separately (or those common features of the feature sets) indicate or verify the object as malware. The correlation rules may define, among other things, patterns (such as sequences) of known malicious behaviors, and, in some embodiments, also patterns of known benign behaviors. For example, in looking at patterns, a behavior may be detected that appears benign, but when examined with other behaviors, may be indicative of malicious activity.

The scorer 440 generates a risk level or numerical score used in a decision of maliciousness by the classification engine 450. The score may be a probability value (expressed in any of various ways such as, for example, a numerical value or percent) or other indicator (quantitative or qualitative) of security risk or threat level. The determination of the risk level of the object processed by the MDS 300 and observed by the endpoint device 200 may be based on monitored events used by the correlation engine 430, including, for example, (i) the location from where the object originated (e.g., a known website compared to an unknown website), (ii) the processed object spawned a new process, and/or (iii) actions taken by received objects during processing (e.g., executable code contained in objects attempts to execute a callback). An object with an associated score (value) above a first threshold may indicate a suspicious object, i.e., an object with a certain probability of being malicious, and above a second, higher threshold may indicate that object should be classified as malware, i.e., an object with a high probability of being malicious. In some embodiments, the scorer 440 may increase or decrease a score provided by the MDS 300 or may generate its own score based on all the available features of the feature sets. For example, if the results of the correlation of monitored behaviors from the endpoint device 200 and the MPS 300 and, in some embodiments, features associated with known malware, reveal a level of similarity above a first predetermined threshold (e.g., 60% or 70%), the scorer 440 may so indicate in its score. The security logic engine 400 may classify the object as malware in response to the score generated by the scorer 440.

Accordingly, the classification engine 450 may be configured to use the correlation information provided by correlation engine 430 and the score provided by a scorer 440 to render a decision as to whether the object is malicious. Illustratively, the classification engine 450 may be configured to classify the correlation information, including monitored behaviors and characteristics, of the object relative to those of known malware and benign content. If a first probability of attack (score) is received by the security logic engine 400 from a malware detection system 300 and differs by a threshold amount or falls beyond a comparison “range” from the probability of attack (score) as determined by the classification engine 450, the security logic engine 400 may generate a second classification (the classification generated by the classification engine 450), and provide the second classification to the malware detection system 300 and report the second classification in an alert. The threshold or comparison range may be fixed, and/or based on a percentage of the initial classification by the malware detection system 300.

In an embodiment, the security logic engine 400 may include a labeler 460 configured to add names of malware or malware families to indicators (signatures) of malware. The labeler 460 may define a new malware family or add the identified malware to the malware family bearing the greatest similarity to the identified malware. The similarity may be based on a correlation, conducted by the correlation engine 430 or the labeler 460, of the identified malware behaviors with a database (not shown) of known malware family entries and associated behaviors. The database entry for the known malware family associated with the newly detected malware may be updated with any new features detected for the malicious object. Alternatively, the association of a malware family may be implemented in a separate module. The malware detection system 300 may update the indicator scanner 330 using the enhanced indicators generated by the labeler 460. These indicators may be used internally by the indicator scanner 470 of the security logic engine 400 or distributed externally as part of indicator reports to the malware detection system (s) 300 or endpoint device(s) 200.

The indicator scanner 470 receives, authenticates, and stores malware indicators, and scans results received from the malware detection system 300 and results received from an endpoint device 200 to determine, when they match, that the object under analysis is malicious. The indicator scanner 470 may also generate enhanced indicators based on the additional information received from the endpoint device 200.

The risk analyzer 480 determines the risk of harm to private network 120 from a verified malicious object based on the results provided by the classification engine 450 and labeler 460 and the indicator scanner 470. The risk analyzer 480 may base the risk of harm on information retrieved from a database regarding named malware or malware families. More specifically, the risk analyzer 480 may receive information about the object from the classification engine 450 and/or the labeler 460, which may also provide the observed behaviors from an endpoint device 200 and a malware detection system 300 as well as a malware family name and/or identified malware name. The risk analyzer 480 may also retrieve information from the network or be provided with information about network device properties (e.g., network location, connected users, operating system version, etc.) for use in its risk assessment. The risk analyzer 480 may also receive a classification of the malware from the classification engine 450 or the signature matcher 470. The risk analyzer 480 determines a risk to the private network 120 using experiential knowledge to correlate the information about the malicious object with the information about the network device properties. The correlation results in a risk profile for each endpoint device, which may be provided to a network administrator.

The risk analyzer 480 may identify endpoint device(s) 200 that may be affected by the cyber-attack involving the verified malicious object. The risk analyzer 480 may utilize the identified features and metadata of a verified malicious object to determine if the system configuration (where a system configuration may be characterized by its hardware and software profiles) of each endpoint device in the private network 120 is vulnerable to the attack. To determine the risk posed by the verified malicious object to each endpoint device 200, the risk analyzer 480 may correlate each feature and its metadata of the object (e.g., software profile running during processing of the object during which malicious behavior was observed) with system configuration attributes of the endpoints on the network. If an endpoint device system configuration correlates with the features and metadata of the verified malware, the risk analyzer 480 identifies the endpoint device as at risk to attack.

In some embodiments, the risk analyzer 480 may communicate to a malware detection system 300 that further analysis of the verified malicious object is necessary if the risk analyzer 480 cannot determine if the verified malicious object will behave maliciously when processed by endpoint device system configurations on the private network. The malware detection system 300 may conduct the further analyses with software profiles and other system characteristics as available to the malware detection system for use with its virtual machines.

The risk analyzer 480 may issue alerts to particular network devices (e.g., endpoint devices, network storage servers being accessed by an endpoint device 200 with a verified malicious object present) to restrict access from network devices found to be correlated with a high risk and/or may issue alerts to a network or security administrator via the reporting engine 490.

The reporting engine 490 is adapted to receive information from the signature matcher 470 and the risk analyzer 480 to generate alerts that identify to a user of an endpoint device, network administrator or an expert network analyst the likelihood of verified network cyber-attack. Other additional information regarding the verified malware may optionally be included in the alerts. For example, additional reported information may contain, in part, typical behaviors associated with the malware, particular classifications of endpoint devices or users that may be targeted, and/or the priority for mitigation of the malware's effects. Additionally, a user of an endpoint device that was to receive the objects and/or a network administer may be alerted to the results of the processing via alert generated by a reporting engine 490. The reporting engine 490 may also provide connected malware detection systems and endpoint devices 200 with updated information regarding malicious attacks and their correlation with particular behaviors identified by the security logic engine 400. Where the security logic engine is a component of the MDS 300, the reporting engine 490 may be eliminated or combined with reporting engine 390.

FIG. 5 represents an exemplary flowchart of a computerized method 700 for operating a cyber-attack detection system. The method 700 starts at step 705 and proceeds to step 710 wherein an endpoint (e.g., a user-operated laptop) begins processing an object. The object may be processed at an endpoint in response to a user action (e.g., in the case where the endpoint is, for example, a laptop, the opening of a document file initiated by a user, in the case where endpoint is, for example, a server, the processing of data on a server in response to a user action, etc.) or a command to process the object, for example, in the case where the endpoint is a server, and as a collateral result, to identify features of the object exhibited during processing to identify a cyber-attack. During step 715 features of the object are monitored and collected while the object is processed by the agent within the endpoint. For example, an object being processed as an executable may exhibit behaviors that may be identified by the endpoint, such as accessing memory outside of its allocated range, injecting data into another running and/or accessing sensitive system files. The features are monitored through the endpoint monitoring logic 255. In step 720 the endpoint determines, through correlation of detected features with the features of known malicious and benign objects, if the object is suspicious using the endpoint classifier 260. Suspicious features may be features that are associated with potential or actual cyber-attacks as well as anomalous (i.e., represent unexpected, undesired, or unwanted) behaviors. If the endpoint determines that no monitored behaviors of the object are suspicious, the method ends at step 755.

If the endpoint identifies features of the object that may be indicative of malware in step 720, the object analyzed may be suspicious. Features may be determined to be indicative of malware, and thus suspicious by the endpoint employing heuristics, black lists (or white lists), or by correlation with features of known malicious and benign objects based on experiential knowledge and machine learning. If the endpoint determines the object is suspicious, further analysis by a malware detection system may be triggered or otherwise initiated, for example, in response to a request from an endpoint. In step 725 the malware detection system receives information related to the object for further processing. The information received by the malware detection system may include the object itself and/or information related to the object (e.g., an identifier associated with the object—the identifier may be used to retrieve a copy of the object stored by a network traffic store such as a network storage system or facility and/or a network periphery device which may have stored the object). In some embodiments the MDS may receive the information related to the object directly from the endpoint which first processed the object, or in other embodiments through an intermediary network device (for example, a network device incorporating a security logic engine). The MDS, in response to the request for analysis of the object may determine if the suspicious object has already been classified as malicious or benign. In some embodiments, to determine if the suspicious object has already been classified, the MDS may access a local store of classifications associated with malicious and/or benign objects previously scanned or submit a request to a network based store of prior classifications. In some embodiments, if the MDS does not have access to the suspicious object for analysis (e.g., the suspicious object was received by the endpoint while it was not connected to a network protected by the MDS and therefore was not stored there), the MDS may request the object for analysis from one or more endpoints on the network (e.g., from the endpoint requesting the analysis of the suspicious object) or from a network traffic store connected to the MDS which may preserve (for a limited period of time) objects transmitted throughout the network. In some embodiments the endpoint may provide additional information about the context of the processing of the object by the endpoint, to the MDS, by providing information about the software profile of the endpoint (such as the operating system and applications available on the endpoint during processing of the object) and/or features detected during processing by the endpoint.

In step 730 the MDS may conduct an analysis of the suspicious object. In some embodiments, the MDS may conduct an analysis of the suspicious object using a static and/or dynamic analysis. The static analysis may, in part, include an indicator scanner 330 and/or a heuristics engine 335 which may utilize statically identified characteristics of the suspicious object to determine if the object is malicious. The dynamic analysis of the suspicious object may include processing the suspicious object in a dynamic analysis logic 340 (e.g., employing a virtual machine of the malware detection system 360) configured with monitoring mechanisms to identify behaviors associated with the executing the object by the virtual machine. In some embodiments the virtual machine may be configured with an operating system, and a set of one or more applications, which may be collectively referred to as a software profile. The software profile of the virtual machine may be consistent (identical and/or similar) with the software profile running on the endpoint having reported the suspicious object, with respect to the operating system and affected applications, or at least in so far as the object requiring processing in the virtual machine requires or expects to find the software running in its run-time environment. In some embodiments, the results (characteristics and behaviors) generated by the analysis of the MDS (including the characteristics identified by the static analysis engine and the behaviors monitored during processing the object in the dynamic analysis logic may be provided, as features, to an MDS correlation engine for determination of maliciousness. In still other embodiments these features detected by the MDS may be provided to a security logic engine for correlation and determination of maliciousness.

In step 735, the security logic engine 400 receives the detected features associated with the suspicious object from the MDS 300 and the endpoint 200. In some embodiments the SLE may be a component of the MDS whereby the SLE shall receive the detected features of the object processed via the reporting engine 390. The SLE may correlate the features received and combine the results of the analysis performed by the MDS 300 in step 730 with the monitored features from the endpoint device 200 in step 715. The combination of the features received by the SLE may be used to generate further correlations of those features with the features of known malicious and benign objects as determined from experiential knowledge and machine learning. If further correlations with features known (labeled) objects exceed a correlation threshold, as described herein, the SLE 400 in step 45 may identify a cyber-attack. If the determination of a cyber-attack cannot be made, the SLE 400 may await further monitored features to be received from the endpoint device 200 or end the analysis.

If the system determines the object is benign, proceed to step 755 where the process ends. If a determination of maliciousness is made, in step 755, the security event analyzing logic 400 may report the determination of a cyber-attack to a security administrator for remediation and/or mitigation.

In some embodiments, the SLE may poll (communicate with) another endpoint to determine whether the other endpoint has processed the same object or whether the other endpoint has detected similar behaviors.

FIG. 6 depicts a flow chart for another exemplary computerized method for operating a security logic engine 400 of a cyber-attack detection system. The method 800 starts at step 805 and proceeds to step 810 where the security logic engine 400 receives features, from an endpoint 200, detected during processing of an object by the endpoint. The processing of the object by the endpoint may be user-initiated (e.g., an object may be a document file, and features are collected by the endpoint when the user opens the file and it is processed by a document viewing application or word processing application).

In step 812 the security logic engine, in response to receiving an indication of suspiciousness from the processing of the object by the endpoint, triggers the processing of the object by a malware detection system. The security logic engine may receive, from the endpoint (directly via a communication interface or indirectly via one or more intermediary devices such as a USB flash drive, network connected appliance, etc.) the object or an object identifier that may be used to retrieve the object. The indication of suspiciousness received by the security logic engine from the endpoint may result from applying a heuristic, at the endpoint, to features detected during processing of the object by the endpoint. In some embodiments, the indication of suspiciousness may result from a correlation of the features detected, during processing by the endpoint, with known malicious or benign objects. The SLE may receive from the endpoint an indication of suspiciousness resulting from a heuristic or correlation of the features detected, employed at the endpoint, and an identifier of the object or the object itself. The SLE may send a message to trigger, in step 815, in response to receiving the indication of suspiciousness, a malware detection system to process the object in a monitored runtime environment such as a virtual machine. The security logic engine may provide the object (or the object identifier) to the static analysis logic 320 and to dynamic analysis logic 340 of the malware detection system so that the object may be processed.

The method continues with the SLE 400 receiving features from the malware detection system 300 in step 815 by the static analysis logic 320 and dynamic analysis logic 340. In some embodiments, the processing and classification conducted by the malware detection system 300 may be directed by the SLE in response to receiving a suspicious object requiring further analysis and/or a security administrator. The malware detection system 300 processes the received network traffic using at least the static analysis logic 320 and/or dynamic analysis logic 340.

In step 820 the security logic engine 400 combines the features and/or information associated with processing the object received from the one or more endpoints 200 and the malware detection system 300. In some embodiments, where the security logic engine is a component of the malware detection system, the security logic engine may receive the information associated with processing the object through the reporting engine 390. In some embodiments, the malware detection system 300 may provide the security logic engine 400 a determination of maliciousness. The security logic engine may verify the malware detection system determination of maliciousness by correlating the received features (the features received from the one more endpoints 200 and the malware detection system 300), with the features of known malicious and benign objects to determine if the object is malicious. If the malware detection system determination of maliciousness and the security logic engine determination of maliciousness correspond, the determination of maliciousness is verified. If determination of maliciousness from the malware detection system 300 and the security logic engine 400 do not correspond, the determination of maliciousness by the security logic engine, based on features received from processing the object at least one endpoint and the malware detection system, supersedes the malware detection system determination of maliciousness and is not verified.

At step 825 the security logic engine 400 determines if the object is malicious based on the information and features related to processing the object that are received. If the object is determined to be malicious the method proceeds to step 840 where security alerts for an administrator are generated and issued by the system. The security alert generated may provide the object, the object identifier, a description of the threat posed to the network, a tracing of the cyber-attack in the network, and an identification of affected endpoints and/or endpoints which may be at risk to the cyber-attack. If the object is determined to not be malicious, the method may terminate in step 845. If the object is determined to be malicious, the method may continue to step 835, wherein the security logic engine 400 identifies endpoints on the network susceptible to the object. For example, the security logic engine 400 may identify endpoints susceptible to the object by identifying endpoints with a software profile similar to the first endpoint and/or the software profile used to configure the malware detection system 300 virtual machine (in step 815).

In some embodiments the method may continue to step 830 where the security logic engine 400 receives additional features detected while processing the same (or a similar) object by a second endpoint. A signature (e.g., a hash) of the object may be used to identify whether the second endpoint is processing the same object. The additional features from the second endpoint may be used by the security logic engine to modify the determination of maliciousness of the object. The additional features received from the second endpoint by the security logic engine in step 830 may be combined with the previously received features for correlation and classification in step 820. The combined features may, in step 825, the object(s) to be malicious. If the determination of maliciousness is in accord with the previous determination of maliciousness, the latter is verified. If the security logic engine modifies the determination of maliciousness for the object in step 820, the SLE may generate and issue a modified report of the cyber-attack. The procedure may proceed, if the object is determined non-malicious to step 845, to step 830 if additional features from the same or yet another endpoint are received by the security logic engine, or step 835 if the object is determined to be malicious and no additional features are received.

In some embodiments the additional features received from the second endpoint in step 830 may result in a feature correlation, with known malicious objects, so as to classify the object as malicious based on certain characteristics. For example, the vulnerability to cyber-attack caused by a malicious object t may only affect endpoints using both Windows 8.1 and Firefox 41.0 and 45.0. In some embodiments the security logic engine 400 may generate alerts for endpoints with the known characteristics of vulnerability. If an alert was generated and issued before the additional information from a second (or more) endpoint, the SLE may modify the existing alert and/or generate and issue another alert.

In still further embodiments, the additional features received from the second endpoint by security logic engine in step 830 may indicate that the object contains polymorphic code (i.e. code that allows the malicious program to mutate—have a different code signature—but maintain the original or core malicious functions and purpose). Polymorphic code may be identified in step 830 if objects, received from a plurality of endpoints, have similar behaviors but different code signatures. The similar behaviors may be identified by the security logic engine identifying a correlation between the objects received from the plurality of endpoints. The additional information related to the object received from another endpoint in step 830 may be used by the security logic engine to determine if the object contains polymorphic code by employing the correlation engine 430 of the security logic engine to determine if the features received from the plurality of endpoints in response to processing object correlate. The identification of polymorphic code may cause the scorer 440 to increase the maliciousness score of the object. The classification of the object in step 825 as malicious may lead the procedure to step 830 or to step 845 if determined to be not malicious.

If the security logic engine does not receive additional features (i.e. step 830 does not occur) and the object is not determined to be malicious, the process proceeds to step 845 where it ends. Conversely, if the security logic engine does not receive additional features and the object is determined to be malicious, the security logic engine generates and issues a report to a security administrator detailing the cyber-attack in step 840 and the procedure proceeds to step 845 where it ends. In some embodiments, the security logic engine may also send messages to endpoints affected by the malicious object, or to endpoints found by the SLE to be at risk of cyber-attack by the malicious object, to notify the endpoint or, via screen display or other alert, of the attack and in some embodiments, to block and/or prevent processing of the object by the endpoint.

In some embodiments, the security logic engine 400 may direct a malware detection system 300 to analyze an object, known to be malicious, to determine the risk posed to other endpoints on the network. For example, the security logic engine may make a determination that an object received from a first endpoint is malicious based on the features received in response to processing the object by the endpoint and a malware detection system. To determine if the object is malicious to other computers on the network the security logic engine may direct the malware detection system to process the object in a virtual machine configured with a software profile similar to at least one other endpoint on the network and collect features related to maliciousness. The security logic engine may receive these features to determine if the object is malicious to endpoints with a same or similar software profile based on the directed processing of the object in the malware detection system.

In yet another embodiment, the security logic engine 400 may direct a second endpoint on the network to collect and return monitored features resulting from processing the object and coordinate with the security logic engine to determine if the object is malicious to the second endpoint. The security logic engine may identify the second endpoint based on a risk to the network as identified by the SLE. For example, if a security logic engine receives features from a first endpoint with a first software profile and the security logic engine determines the object is malicious in coordination with a malware detection system, the security logic engine may identify a set of endpoints with a different software profile and direct at least one endpoint of the set of endpoints to return monitored features resulting from processing the object to the security logic engine for analysis. The security logic engine may combine the features received and correlate the results to classify whether the object is malicious to the set of endpoints. By directing the analysis of the object by at least one of the set of endpoints by the security logic engine, the security logic engine may determine if the object represents a cyberattack risk to the set of endpoints.

As shown in FIG. 7, an embodiment of the system includes a plurality of endpoints 200, a malware detection system 300, and a security logic engine 400. The security logic engine 400 may be implemented as a component of the malware detection system 300 (as shown in FIG. 7) or as a separate component. As a separate component the security logic engine 400 may be communicatively coupled with the endpoint 200 and/or the malware detection system 300 using communication interfaces, e.g., a network interface (such as over an Ethernet connection, USB connection, etc.). The endpoint 200 initiates the interoperative malware detection system (MDS) by identifying a suspicious object.

The endpoint 200 may identify a suspicious object and communicate at least an identifier of the suspicious object and/or the suspicious object to the MDS. The endpoint 200 may identify the suspicious object by monitoring the processing of the object. The object may be processed in response to user interaction (e.g., opening a document file from the internet, where the document is the object) or in response to an automated process (e.g., the endpoint receives an object from another system, via a network connection, for monitoring during processing). The endpoint may detect features related to the processing of the object at the endpoint. The endpoint may correlate these features with known features of malicious objects and classify the object as suspicious. If the endpoint determines the object is suspicious it may communicate at least the suspicious object or suspicious object identifier to an SLE or MDS. The endpoint may optionally also communicate a set of detected features and/or additional information about the endpoint (e.g., software profile information, such as the operating system, applications available on the endpoint, and/or the programs responsible for processing the object).

The suspicious object or suspicious object identifier may be communicated directly to the MDS or via a security logic engine (SLE). If the MDS and/or SLE receive a suspicious object identifier via a communication interface, each may retrieve the suspicious object from an object store (e.g., a network traffic storage system), a local store of objects, and/or by requesting the endpoint having communicated the suspicious object identifier return the object to each respective system.

The malware detection system 300 may receive the suspicious object (or suspicious object identifier by which the MDS may retrieve the suspicious object as previously described herein) for processing and classification as malicious or benign. In some embodiments, the MDS may receive the suspicious object through the integrated SLE component, or alternatively the MDS may retrieve the suspicious object directly. The MDS may process the suspicious object, to detect features, in at least a static analysis logic 320 or a dynamic analysis logic 340. The static analysis logic 320 may identify features of the object that may provide sufficient features for a correlation of the MDS to classify the suspicious object as malicious. In some embodiments the features detected by the static analysis logic may be used by the dynamic analysis logic 340 to guide feature detection during processing.

The dynamic analysis logic 340 may configure one or more virtual machine(s) (see FIG. 3) of the MDS for processing of the suspicious object. The virtual machine(s) may be configured with behavioral monitors to identify features resulting from the processing of the suspicious object in the virtual machine(s). In some embodiments, the dynamic analysis logic 340 of the MDS may receive software profile information related to the endpoint (e.g., endpoint operating system, applications available, etc.) that detected the suspicious object. The received software profile information may be used to configure the dynamic analysis logic 340 virtual machine(s). In some embodiments the dynamic analysis logic 340 may determine the configuration to be used by a virtual machine by communicating directly with the endpoint to identify the requirements of the software profile, and/or by querying an endpoint management system, storing information about endpoints on the network, to identify an appropriate software profile.

The features detected by the MDS by processing the suspicious object using the static analysis logic 320 and/or the dynamic analysis logic 340, may be correlated with features of known malicious objects to determine if the suspicious object is malicious or benign. The features detected by the MDS may be communicated to the SLE 400 and combined with information received by the SLE from the endpoint. The SLE may correlate the received information from the endpoint and the MDS to classify the suspicious object as malicious or benign. The classification done by the SLE, if consistent with the determination by the MDS, verifies the MDS determination of maliciousness.

Similarly, in an embodiment, the SLE may direct the MDS to process an object, in the virtual machine(s) configured with a software profile of an unaffected endpoint. The SLE may provide the object directly to the MDS or provide an object identifier by which the MDS may retrieve the object via the network. During processing of the object by the MDS in the virtual machine(s) the MDS may detect features associated with processing of the object. The features detected by the MDS may be provided by the MDS to the SLE for correlation and classification in combination with any features previously received by the SLE related to that object. By classifying the object with the features received from the MDS directed to process the object, the SLE may identify a set of endpoints on the network, with a given software profile, that may be vulnerable to the object.

In further embodiments, the SLE may identify polymorphic code by combining the features detected by at least two processing locations (e.g., MDS and endpoint, or between two endpoints, etc.). The SLE may determine that the features detected by the plurality of processing locations indicate identical behaviors that relate to maliciousness, however the objects have different code signatures. The different code signatures but identical behaviors may indicate the existence of malicious polymorphic code. If the object in which the polymorphic code is identified is found to be malicious by the SLE, the SLE may generate and issue an alert to a security administrator and/or to affected endpoints providing information about the malicious object.

In some embodiments, the SLE may determine the progress of a cyber-attack through the network. The SLE, having identified a cyber-attack based on features collected while processing an object by at least one endpoint and a malware detection system, may receive further features and/or information related to the cyber-attack (related to processing the object) from other endpoints on the network. In some embodiments this may include information from known affected endpoints related to communication with other endpoints, this communication may represent transmission of malicious content, related to the cyber-attack, to additional endpoints. By combining and analyzing the received information related to the cyber-attack, from the endpoints, the SLE may determine the scope of the attack and trace the attack through the network. The SLE may generate reports tracing the cyber-attack to be used in mitigation and repair of the network.

In some embodiments, if the SLE determines the suspicious object is malicious, the SLE may determine that a cyber-attack is underway and initiate mitigation techniques. The mitigation techniques may be initiated automatically by the SLE or in response to commands by a security administrator. A mitigation technique may include issuing an alert to affected endpoints regarding the malicious object and/or issuing an alert to endpoints with a similar software profile (and thus similarly vulnerable) as the affected endpoint. Another mitigation technique may include generating and distributing information to block and/or stop the malicious object from being processed by another endpoint on the network.

The foregoing description has been directed to specific embodiments. It will be apparent, however, that other variations and modifications may be made to the described embodiments, with the attainment of some or all of their advantages. For instance, it is expressly contemplated that the components and/or elements described herein can be implemented as software encoded on a tangible (non-transitory) computer-readable medium (e.g., disks, electronic memory, and/or CDs) having program instructions executing on a computer, hardware, firmware, or a combination thereof. Moreover, the embodiments or aspects thereof can be implemented in hardware, firmware, software, or a combination thereof. In the foregoing description, for example, in certain situations, terms such as “engine,” “component” and “logic” are representative of hardware, firmware and/or software that is configured to perform one or more functions. As hardware, engine (or component/logic) may include circuitry having data processing or storage functionality. Examples of such circuitry may include, but is not limited or restricted to a microprocessor, one or more processor cores, a programmable gate array, a microcontroller, an application specific integrated circuit, semiconductor memory, or combinatorial logic. Accordingly this description is to be taken only by way of example and not to otherwise limit the scope of the embodiments herein. Therefore, it is the object of the appended claims to cover all such variations and modifications as come within the true spirit and scope of the invention. 

What is claimed is:
 1. A computerized method to determine a potential cyber-attack being conducted on a network, comprising: receiving, by a security logic engine, a first set of events associated with a first object and a second set of events associated with a second object that is different from the first set of events, wherein (1) the first set of events is provided from a first endpoint of a plurality of endpoints within the network and each event of the first set of events corresponds to information related to a behavior that is monitored during processing of the first object by the first endpoint, and (2) the second set of events is provided from an analysis system different than any of the plurality of endpoints and each event of the second set of events corresponds to information related to a behavior that is monitored during processing of the second object by the analysis system; generating, by the security logic engine, a first determination of a likelihood of the cyber-attack being conducted on the network by at least correlating the first set of events and the second set of events with at least events associated with at least known malicious objects; generating, by the security logic engine, a first alert to report the first determination of the likelihood of the cyber-attack occurring at the first endpoint; and determining a scope of the cyber-attack by at least identifying whether one or more endpoints of the plurality of endpoints having a different software profile than the first endpoint are vulnerable to the cyber-attack based on (i) obtaining the different software profile from the one or more endpoints, (ii) providing the different software profile to the analysis system, (iii) receiving a third set of events corresponding to analysis results representing a processing of the first object by an endpoint with the different software profile, (iv) performing a second determination to confirm or modify the likelihood of the cyber-attack being conducted on the network based on the analysis results, and (v) generating a second alert to report the second determination to enlarge the scope of the cyber-attack in response to confirming the cyber-attack being directed to multiple of different endpoints within the network.
 2. The computerized method of claim 1, wherein the analysis system is a malware detection system.
 3. The computerized method of claim 2, wherein the second set of events comprise behaviors exhibited during processing of the second object in a first virtual machine of the malware detection system.
 4. The computerized method of claim 2, wherein the security logic engine is a component of the malware detection system.
 5. The computerized method of claim 2, wherein the identifying whether any endpoint of the plurality of endpoints is vulnerable further comprises: identifying a second software profile being the different software profile; and requesting the malware detection system to conduct the further analysis by at least configuring a virtual machine of the malware detection system with the second software profile.
 6. The computerized method of claim 1, wherein the first alert includes one or more indicators associated with the first set of events or the second set of events that are indicative of malware.
 7. The computerized method of claim 1, wherein the generating of the first determination further comprises correlating the first set of events and the second set of events with at least events associated with a corpus of known benign objects and the known malicious objects.
 8. The computerized method of claim 1, wherein the second alert generated by the security logic engine further comprises information to prevent the processing of a third object, identical in functionality to the first object, by each of the plurality of endpoints.
 9. The computerized method of claim 1, wherein the determining the scope of the cyber-attack by at least identifying whether an endpoint of the plurality of endpoints having a different software profile than the first endpoint invulnerable to the cyber-attack further comprises: associating, by the security logic engine, each endpoint of the plurality of endpoints, communicatively coupled with the security logic engine, with a software profile; and determining, by the security logic engine, a vulnerability of each endpoint of the plurality of endpoints in response to correlating between a vulnerable software profile and the software profiles of each endpoint.
 10. The computerized method of claim 1, wherein the first object is identical in operation to the second object.
 11. A non-transitory computer-readable medium including software that, upon execution within a security logic engine, conducts analyses to determine whether an object is malicious representative of a cyber-attack is being conducted on a network, comprising: receiving a first set of events associated with a first object and a second set of events associated with a second object that is different from the first set of events, wherein (1) the first set of events is provided from a first endpoint of a plurality of endpoints within a network and each event of the first set of events corresponds to information related to a behavior monitored during processing of the first object by the first endpoint, and (2) the second set of events is provided from an analysis system different from an endpoint of the plurality of endpoints and each event of the second set of events corresponds to information related to a behavior monitored during processing of the second object by the analysis system; generating, by the security logic engine, a first determination of a likelihood of the cyber-attack being conducted on the network by at least correlating the first set of events and the second set of events with at least events associated with at least known malicious objects; generating a first alert to report the first determination of the likelihood of the cyber-attack occurring at the first endpoint; and determining a scope of the cyber-attack by at least identifying whether one or more endpoints of the plurality of endpoints having a different software profile than the first endpoint are vulnerable to the cyber-attack based on (i) obtaining the different software profile from the one or more endpoints, (ii) providing the different software profile to the analysis system, (iii) receiving a third set of events corresponding to analysis results representing a processing of the first object by an endpoint with the different software profile, (iv) performing a second determination to confirm or modify the likelihood of the cyber-attack being conducted on the network based on the analysis results, and (v) generating a second alert to report the second determination to enlarge the scope of the cyber-attack in response to confirming the cyber-attack being directed to multiple of different endpoints within the network.
 12. The non-transitory computer-readable medium of claim 11, wherein the second set of events comprise behaviors exhibited during processing of the second object in a first virtual machine of the analysis system being a malware detection system.
 13. The non-transitory computer-readable medium of claim 12, wherein the security logic engine is a component of the malware detection system.
 14. The non-transitory computer-readable medium of claim 11, wherein the first alert includes one or more indicators associated with the first set of events or the second set of events that are indicative of malware.
 15. The non-transitory computer-readable medium of claim 11, wherein the generating of the first determination further comprises correlating the first set of events and the second set of events with at least events associated with a corpus of known benign objects and the known malicious objects.
 16. The non-transitory computer-readable medium of claim 11, wherein the second alert, generated by the security logic engine, further comprises information to prevent the processing of the first object by each of the plurality of endpoints.
 17. The non-transitory computer-readable medium of claim 11, wherein the software, during execution, determining the scope of the cyber-attack by at least identifying whether an endpoint of the plurality of endpoints having a different software profile than the first endpoint is vulnerable to the cyber-attack by at least: associating each endpoint of the plurality of endpoints, communicatively coupled with the security logic engine, with a software profile; and determining a vulnerability of each endpoint of the plurality of endpoints in response to correlating between a vulnerable software profile and the software profiles of each endpoint.
 18. The non-transitory computer-readable medium of claim 11 including the software that, upon execution within the security logic engine, further conducts analyses comprising: receiving information related to communications between one or more endpoints and the first endpoint having processed the first object; and determining one or more endpoints of the plurality of endpoints other than the first endpoint are at risk of being subject to the cyber-attack.
 19. The non-transitory computer-readable medium of claim 11, wherein the first object includes malware operating equivalently to malware within the second object. 